NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Every Breath You Don’t Take: Deepfake Speech Detection Using Breath

Layton, Seth; DeAndrade, Thiago; Olszewski, Daniel; Warren, Kevin; Butler, Kevin; Traynor, Patrick; Gates, Carrie (September 2025, ACM Digital Threats: Research and Practice (DTRAP))

Deepfake speech represents a real and growing threat to systems and society. Many detectors have been created to aid in defense against speech deepfakes. While these detectors implement myriad methodologies, many rely on low-level fragments of the speech generation process. We hypothesize that breath, a higher-level part of speech, is a key component of natural speech and thus improper generation in deepfake speech is a performant discriminator. To evaluate this, we create a breath detector and leverage this against a custom dataset of online news article audio to discriminate between real/deepfake speech. Additionally, we make this custom dataset publicly available to facilitate comparison for future work. Applying our simple breath detector as a deepfake speech discriminator on in-the-wild samples allows for accurate classification (perfect 1.0 AUPRC and 0.0 EER on test data) across 33.6 hours of audio. We compare our model with the state-of-the-art SSL-wav2vec and Codecfake models and show that these complex deep learning model completely either fail to classify the same in-the-wild samples (0.72 AUPRC and 0.89 EER), or substantially lack in the computational and temporal performance compared to our methodology (37 seconds to predict a one minute sample with Codecfake vs. 0.3 seconds with our model)
more » « less
Full Text Available
Concerning the Responsible Use of AI in the U.S. Criminal Justice System

https://doi.org/10.1145/3722548

Moore, Cristopher; Gill, Catherine; Bliss, Nadya; Butler, Kevin; Forrest, Stephanie; Lopresti, Dan; Maher, Mary Lou; Mentis, Helena; Shekhar, Shashi; Stent, Amanda; et al (September 2025, Communications of the ACM)

Seeking insight into AI decision-making processes to better address bias and improve accountability in AI systems.
more » « less
Full Text Available
Data to Infinity and Beyond: Examining Data Sharing and Reuse Practices in the Computer Security Community

https://doi.org/10.1109/SP61157.2025.00180

Crowder, Anna; Lu, Allison; Childs, Kevin; Stillman, Carson; Traynor, Patrick; Butler, Kevin RB (May 2025, IEEE)

Sharing high-quality research data specifically for reuse in future work helps the scientific community progress by enabling researchers to build upon existing work and explore new research questions without duplicating data collection efforts. Because current discussions about research artifacts in Computer Security focus on reproducibility and availability of source code, the reusability of data is unclear. We examine data sharing practices in Computer Security and Measurement to provide resources and recommendations for sharing reusable data. Our study covers five years (2019–2023) and seven conferences in Computer Security and Measurement, identifying 948 papers that create a dataset as one of their contributions. We analyze the 265 accessible datasets, evaluating their under-standability and level of reuse. Our findings reveal inconsistent practices in data sharing structure and documentation, causing some datasets to not be shared effectively. Additionally, reuse of datasets is low, especially in fields where the nature of the data does not lend itself to reuse. Based on our findings, we offer data-driven recommendations and resources for improving data sharing practices in our community. Furthermore, we encourage authors to be intentional about their data sharing goals and align their sharing strategies with those goals.
more » « less
Full Text Available
Extending the Heilmeier Catechism to Evaluate Security and Privacy Systems: Who is Left Out?

https://doi.org/10.1109/MSEC.2025.3554848

Butler, Kevin; Hugenberg, Kurt; Jain, Eakta; Kapadia, Apu; Kohno, Tadayoshi; Redmiles, Elissa M; Roesner, Franziska; Sim, Mattea; Traynor, Patrick; Barakat, Hanna (May 2025, IEEE Security & Privacy)

The Heilmeier Catechism consists of a set of questions that researchers and practitioners can consider when formulating research and applied engineering projects. In this article, we suggest explicitly asking who is included and who is left out of consideration.
more » « less
Full Text Available
Coach, Data Analyst, and Protector: Exploring Data Practices of Collegiate Coaching Staff

https://doi.org/10.1145/3706598.3714026

Brewer, Mollie; Childs, Kevin; Thomas, Spencer; Wilkins, Celeste; Boyer, Kristy Elizabeth; Nichols, Jennifer A; Butler, Kevin RB; Beatty, Garrett F; Ferris, Daniel P (April 2025, ACM)

A rapidly emerging research community at the intersection of sport and human-computer interaction (SportsHCI) explores how technology can support physically active humans, such as athletes. At highly competitive levels, coaching staff play a central role in the athlete experience by using data to enhance performance, reduce injuries, and foster team success. However, little is known about the practices and needs of these coaching staff. We conducted five focus groups with 17 collegiate coaching staff across three women’s teams and two men’s teams at an elite U.S. university. Our findings show that coaching staff selectively use data with the goal of balancing performance goals, athlete emotional well-being, and privacy. This paper contributes design recommendations to support coaching staff in operating across the data life cycle through gathering, sharing, deciding, acting, and assessing data as they aim to support team success and foster the well-being of student-athletes.
more » « less
Full Text Available
Characterizing the Impact of Audio Deepfakes in the Presence of Cochlear Implant

https://doi.org/10.14722/ndss.2025.241117

Pasternak, Magdalena; Warren, Kevin; Olszewski, Daniel; Nittrouer, Susan; Traynor, Patrick; Butler, Kevin (January 2025, Internet Society)

Cochlear implants (CIs) allow deaf and hard-ofhearing individuals to use audio devices, such as phones or voice assistants. However, the advent of increasingly sophisticated synthetic audio (i.e., deepfakes) potentially threatens these users. Yet, this population’s susceptibility to such attacks is unclear. In this paper, we perform the first study of the impact of audio deepfakes on CI populations. We examine the use of CI-simulated audio within deepfake detectors. Based on these results, we conduct a user study with 35 CI users and 87 hearing persons (HPs) to determine differences in how CI users perceive deepfake audio. We show that CI users can, similarly to HPs, identify text-to-speech generated deepfakes. Yet, they perform substantially worse for voice conversion deepfake generation algorithms, achieving only 67% correct audio classification. We also evaluate how detection models trained on a CI-simulated audio compare to CI users and investigate if they can effectively act as proxies for CI users. This work begins an investigation into the intersection between adversarial audio and CI users to identify and mitigate threats against this marginalized group.
more » « less
Full Text Available
Detecting IMSI-Catchers by Characterizing Identity Exposing Messages in Cellular Traffic

https://doi.org/10.14722/ndss.2025.241115

Tucker, Tyler; Bennett, Nathaniel; Kotuliak, Martin; Erni, Simon; Capkun, Srdjan; Butler, Kevin; Traynor, Patrick (January 2025, Internet Society)

IMSI-Catchers allow parties other than cellular network providers to covertly track mobile device users. While the research community has developed many tools to combat this problem, current solutions focus on correlated behavior and are therefore subject to substantial false classifications. In this paper, we present a standards-driven methodology that focuses on the messages an IMSI-Catcher must use to cause mobile devices to provide their permanent identifiers. That is, our approach focuses on causal attributes rather than correlated ones. We systematically analyze message flows that would lead to IMSI exposure (most of which have not been previously considered in the research community), and identify 53 messages an IMSI- Catcher can use for its attack. We then perform a measurement study on two continents to characterize the ratio in which connections use these messages in normal operations. We use these benchmarks to compare against open-source IMSI-Catcher implementations and then observe anomalous behavior at a large- scale event with significant media attention. Our analysis strongly implies the presence of an IMSI-Catcher at said public event (p << 0.005), thus representing the first publication to provide evidence of the statistical significance of its findings.
more » « less
Full Text Available
Copycat Crimes: Characterizing Code Reuse in the Mobile Android Stalkerware Ecosystem

https://doi.org/10.1109/MSEC.2025.3579799

Gibson, Cassidy; Blue, Logan; Traynor, Patrick; Butler, Kevin (July 2025, IEEE Security & Privacy)

We examine stalkerware apps used to target vulnerable people in abusive relationships. We find extensive code reuse, repackaging, and obfuscation. App markets could flag shared code to remove these apps and protect users from technology-facilitated abuse.
more » « less
Full Text Available
SoK: The Good, The Bad, and The Unbalanced: Measuring Structural Limitations of Deepfake Media Datasets

Layton, Seth; Tucker, Tyler; Olszewski, Daniel; Warren, Kevin; Butler, Kevin; Traynor, Patrick (August 2024, USENIX Security Symposium)

Full Text Available
"Better Be Computer or I'm Dumb": A Large-Scale Evaluation of Humans as Audio Deepfake Detectors

https://doi.org/10.1145/3658644.3670325

Warren, Kevin; Tucker, Tyler; Crowder, Anna; Olszewski, Daniel; Lu, Allison; Fedele, Caroline; Pasternak, Magdalena; Layton, Seth; Butler, Kevin; Gates, Carrie; et al (December 2024, ACM)

Audio deepfakes represent a rising threat to trust in our daily communications. In response to this, the research community has developed a wide array of detection techniques aimed at preventing such attacks from deceiving users. Unfortunately, the creation of these defenses has generally overlooked the most important element of the system - the user themselves. As such, it is not clear whether current mechanisms augment, hinder, or simply contradict human classification of deepfakes. In this paper, we perform the first large-scale user study on deepfake detection. We recruit over 1,200 users and present them with samples from the three most widely-cited deepfake datasets. We then quantitatively compare performance and qualitatively conduct thematic analysis to motivate and understand the reasoning behind user decisions and differences from machine classifications. Our results show that users correctly classify human audio at significantly higher rates than machine learning models, and rely on linguistic features and intuition when performing classification. However, users are also regularly misled by pre-conceptions about the capabilities of generated audio (e.g., that accents and background sounds are indicative of humans). Finally, machine learning models suffer from significantly higher false positive rates, and experience false negatives that humans correctly classify when issues of quality or robotic characteristics are reported. By analyzing user behavior across multiple deepfake datasets, our study demonstrates the need to more tightly compare user and machine learning performance, and to target the latter towards areas where humans are less likely to successfully identify threats.
more » « less
Full Text Available

« Prev Next »

Search for: All records